3574 results found.
Written
Lexicon,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
CreativeCommons
Size:
5000 synsets Production Status:
Newly created-finished
Use:
Examining the natural selection of words
-
Paper title:WordWars: A Dataset to Examine the Natural Selection of Words
-
Paper track:Written/oral presentation
-
Paper status:Accept Poster+DemoSuggested
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Saif M. Mohammad | WordWars | /N |
Documentation:
Documentation is available (in English)
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
License:
CreativeCommons
Size:
62000 entries Production Status:
Newly created-finished
Use:
Studying child language, especially poetry
-
Paper title:PoKi: A Large Dataset of Poems by Children
-
Paper track:Written/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Saif M. Mohammad | Poems by Kids (PoKi) | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
Arabic Azerbaijani Belarusian Bulgarian Catalan Danish English Estonian Filipino Finnish Hindi Hungarian Indonesian Irish Italian Japanese Kazakh Korean Latvian Lithuanian Mongolian Norwegian Polish Portuguese Russian Serbian (Latin) Slovenian Spanish Swedish Tamil Turkish Ukrainian Urdu Uzbek Vietnamese ces deu ell fas fra isl kat mkd nld ron slk sqi zho
Availability:
Freely Available
License:
GNU-GPL v.3
Size:
45 billion words Production Status:
Newly created-finished
Use:
Corpus Creation/Annotation
-
Paper title:Geographically-Balanced Gigaword Corpora for 50 Language Varieties
-
Paper track:Written/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Jonathan Dunn | GeoWAC | /N |
Documentation:
https://github.com/jonathandunn/earthlings
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Data Center(s)
License:
LDC
Size:
4407648 KByte Production Status:
Existing-used
Use:
Document Classification, Text categorisation
-
Paper title:Email Classification Incorporating Social Networks and Thread Structure
-
Paper track:Written/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Sakhar Alkhereyf | Avocado Research Email Collection | /N |
Documentation:
https://catalog.ldc.upenn.edu/docs/LDC2015T03/
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely available for academic use
License:
Size:
None Production Status:
Newly created-finished
Use:
Corpus Creation/Annotation
-
Paper title:Email Classification Incorporating Social Networks and Thread Structure
-
Paper track:Written/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Sakhar Alkhereyf | Gender, Power, Business/Personal Type Annotations for the Enron Email Corpus | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely available for academic use
License:
Size:
None Production Status:
Newly created-finished
Use:
Corpus Creation/Annotation
-
Paper title:Email Classification Incorporating Social Networks and Thread Structure
-
Paper track:Written/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Sakhar Alkhereyf | Business/Personal Type Annotations for the Avocado Email Corpus | /N |
Documentation:
None
Written
Evaluation Data,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
1030 word pairs OtherProduction Status:
Newly created-finished
Use:
Evaluation/Validation
-
Paper title:MSD-1030: A Well-built Multi-Sense Evaluation Dataset for Sense Representation Models
-
Paper track:Evaluation/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Yow-Ting Shiue | Multi-Sense Dataset (MSD-1030) | /N |
Documentation:
None
Written
Corpus,
Language Type:
Bilingual
Languages:
English Japanese
Availability:
From Owner
License:
Size:
8748 sentences Production Status:
Newly created-finished
Use:
Information Extraction, Information Retrieval
-
Paper title:A Contract Corpus for Recognizing Rights and Obligations
-
Paper track:Written/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Ruka Funaki | Contract Corpus for Recognizing Rights and Obligations | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
10000 sentences Production Status:
Newly created-in progress
Use:
Document Classification, Text categorisation
-
Paper title:Recognition of Implicit Geographic Movement in Text
-
Paper track:Written/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Scott Pezanowski | GeoMovement Corpora | /N |
Documentation:
Descriptions of geographic movement in English
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
License:
Size:
None Production Status:
Existing-used
Use:
Evaluation/Validation
-
Paper title:On the Correlation of Word Embedding Evaluation Metrics
-
Paper track:Evaluation/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | François Torregrossa | BLESS | /N |
Documentation:
None




